Lexical Acquisition with WordNet and the Mikrokosmos Ontology

نویسندگان

  • Thomas P. O'Hara
  • Kavi Mahesh
  • Sergei Nirenburg
چکیده

This paper discusses an approach to augmenting a lexicon for knowledge-based machine translation (KBMT) with information derived from WordNet. The Mikrokosmos project at NMSU's Computing Research Laboratory has concentrated on the creation of the Spanish and Japanese lexicons, so the English lexicon is less developed. We investigated using WordNet as a means to automate portions of the English lexicon development. Several heuristics are used to nd the WordNet synonym sets corresponding to the concepts in the Mikrokos-mos language-independent ontology. Two of these heuristics exploit the WordNet is-a hierarchy: one performs hierarchical matching of both taxonomies, and the other computes similarity based on frequency of deening words and their ancestors in a corpus. The result is a lexicon acquisition tool that produces plausible lexical mappings from English words into the Mikrokosmos ontology. Initial performance results are included, which indicate good accuracy in the mappings.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ontology Development for Machine Translation: Ideology and Methodology

In the Mikrokosmos approach to knowledge-based machine translation, lexical representation of word meanings as well as text meaning representation is grounded in a broad-coverage ontology of the world. We have developed such a language-neutral ontology for the purpose of machine translation in the situation of the Mikrokosmos project. In order to acquire a fairly large ontology with limited tim...

متن کامل

Converting Mikrokosmos Frames into Description Logics

Mikrokosmos contains an ontology plus a number of lexicons in different languages that were originally developed for machine translation. The underlying representation formalism for these resources is an ad-hoc frame-based language which makes it difficult to inter-operate Mikrokosmos with state-ofthe-art knowledge-based systems. In this paper we propose a translation from the frame-based repre...

متن کامل

The Omega Ontology

We present the Omega ontology, a large terminological ontology obtained by reformulating WordNet and Mikrokosmos into a new feature-oriented upper model. We explain the organizing principles of the representation used for Omega and discuss the methodology used to merge the constituent conceptual hierarchies. We survey a range of auxiliary knowledge sources (including instances, verb frame annot...

متن کامل

Towards Semi Automatic Construction of a Lexical Ontology for Persian

Lexical ontologies and semantic lexicons are important resources in natural language processing. They are used in various tasks and applications, especially where semantic processing is evolved such as question answering, machine translation, text understanding, information retrieval and extraction, content management, text summarization, knowledge acquisition and semantic search engines. Altho...

متن کامل

Enriching Ontology Concepts Based on Texts from WWW and Corpus

In spite of the growing of ontological engineering tools, ontology knowledge acquisition remains a highly manual, time-consuming and complex task. Automatic ontology learning is a well-established research field whose goal is to support the semi-automatic construction of ontologies starting from available digital resources (e.g., A corpus, web pages, dictionaries, semi-structured and structured...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998